# Document OCR Enhancement
PE Lang G14 448
Apache-2.0
The Perception Encoder is a state-of-the-art image and video understanding encoder trained through vision-language training, with strong generalization capabilities.
Text-to-Image
P
facebook
247
11
Eagle X5 7B
Eagle is a series of vision-centric high-resolution multimodal large language models, supporting input resolutions up to 1K and above, excelling in tasks such as optical character recognition and document understanding.
Image-to-Text
Transformers

E
NVEagle
918
26
Featured Recommended AI Models